Solving Electromagnetic Scattering Problems With Tens of Billions of Unknowns Using GPU Accelerated Massively Parallel MLFMA
نویسندگان
چکیده
In this article, a massively parallel approach of the multilevel fast multipole algorithm (PMLFMA) on graphics processing unit (GPU) heterogeneous platform, noted as GPU-PMLFMA, is presented for solving extremely large electromagnetic scattering problems involving tens billions unknowns, approach, flexible and efficient ternary partitioning scheme employed at first to partition MLFMA octree among message-passing interface (MPI) processes. Then, computationally intensive parts PMLFMA each MPI process, matrix filling, aggregation disaggregation, so are accelerated by using GPU. Different parallelization strategies in coincidence with designed GPU ensure high computational throughput. Special memory usage strategy improve efficiency benefit data reusing. The CPU/GPU asynchronous computing pattern OpenMP compute unified device architecture (CUDA), respectively, accelerating CPU execution computation time overlapped. architecture-based optimization implemented further efficiency. Numerical results demonstrate that proposed GPU-PMLFMA can achieve over three times speedup, compared eight-threaded conventional PMLFMA. Solutions electrically complicated objects about 24 000 wavelengths 41.8 billion unknowns presented.
منابع مشابه
Solutions of Extremely Large Electromagnetics Problems Involving Tens of Millions of Unknowns Using Parallel MLFMA and Preconditioners
We present fast and accurate solutions of very large electromagnetics problems discretized with tens of millions of unknowns. Scattering and radiation problems involving three-dimensional large-scale objects are formulated accurately with surface integral equations. The resulting dense matrix equations are solved iteratively by using a parallel implementation of the multilevel fast multipole al...
متن کاملSolving Large Regression Problems using an Ensemble of GPU-accelerated ELMs
This paper presents an approach that allows for performing regression on large data sets in reasonable time. The main component of the approach consists in speeding up the slowest operation of the used algorithm by running it on the Graphics Processing Unit (GPU) of the video card, instead of the processor (CPU). The experiments show a speedup of an order of magnitude by using the GPU, and comp...
متن کاملTask parallel implementation of a solver for electromagnetic scattering problems
Electromagnetic computations, where the wavelength is small in relation to the geometry of interest, become computationally demanding. In order to manage computations for realistic problems like electromagnetic scattering from aircraft, the use of parallel computing is essential. In this paper, we describe how a solver based on a hierarchical nested equivalent source approximation can be implem...
متن کاملParallel Data Partitioning Strategy in Solving Large Scale Electromagnetic Scattering Problems
The multilevel fast multipole algorithm (MLFMA) has shown great efficiency in solving large scale electromagnetic scattering problems. However, when unknowns become up to tens of millions, it is not trivial to keep high performance because of the complicated structure and calculation of MLFMA. In order to get rid of the bottleneck caused by load balancing, a parallel data partitioning strategy ...
متن کاملA massively parallel GPU-accelerated model for analysis of fully nonlinear free surface waves
We implement and evaluate a massively parallel and scalable algorithm based on a multigrid preconditioned Defect Correction method for the simulation of fully nonlinear free surface flows. The simulations are based on a potential model that describes wave propagation over uneven bottoms in three space dimensions and is useful for fast analysis and prediction purposes in coastal and offshore eng...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Antennas and Propagation
سال: 2022
ISSN: ['1558-2221', '0018-926X']
DOI: https://doi.org/10.1109/tap.2022.3161520